On scattered subword complexity

نویسنده

  • Zoltán Kása
چکیده

Sequences of characters called words or strings are widely studied in combinatorics, and used in various fields of sciences (e.g. chemistry, physics, social sciences, biology [2, 3, 4, 11] etc.). The elements of a word are called letters. A contiguous part of a word (obtained by erasing a prefix or/and a suffix) is a subword or factor. If we erase arbitrary letters from a word, what is obtained is a scattered subword. Special scattered subwords, in which the consecutive letters are at distance at most d (d ≥ 1) in the original word, are called d-subwords [7, 8]. In [9] the super -d-subword is defined, in which case the distances are of length at least d. The super-d-complexity, as the number of such subwords, is computed for rainbow words (words with pairwise different letters). In this paper we define special scattered subwords, for which the distance in the original word of length n between two letters which will be consecutive in the subword, is taken from a subset of {1, 2, . . . , n − 1}. The complexity of a word is defined as the number of all its different subwords. Similar definitions are for d-complexity, super-d-complexity and scattered subword complexity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scattered subwords and composition of natural numbers

Special scattered subwords in which the length of the gaps are bounded by two natural numbers are considered. For rainbow words the number of such scattered subwords is equal to the number of special restricted compositions of natural numbers in which the components are natural numbers from a given interval. Linear algorithms to compute such numbers are given. We also introduce the concepts of ...

متن کامل

On Inequalities between Subword Histories

By taking out letters from a word we get a subword. Both continuous sub-words (also called factors or simply subwords) and scattered subwords were extensively studied. In [4] the authors introduced Parikh matrices, structures that contain more information about the words than Parikh vectors, which tell us only the number of di $ffere\tau 1T$ letters building the word. $I_{l1}[5]$ the notion of ...

متن کامل

Absoluteness of subword inequality is undecidable

In a given word, one can count the number of occurrences of other words as a scattered subword. These counts can be “added” and/or “multiplied.” A subword history gives an instruction of what words to be counted and how these counts to be added and multiplied with other counts or integer constants, and hence, determines its unique value in a given word. Mateescu, Salomaa, and Yu asked: “is it d...

متن کامل

Subword histories and associated matrices

The basic numerical quantity investigated in this paper is |w|u, the number of occurrences of a word u as a scattered subword of a word w. Arithmetical combinations of such quantities yield a so-called subword history. We investigate the information content of subword histories. Reducing subword histories to linear ones, as well as the recently introduced Parikh matrices, will be important tool...

متن کامل

Subword complexity and power avoidance

We begin a systematic study of the relations between subword complexity of infinite words and their power avoidance. Among other things, we show that – the Thue-Morse word has the minimum possible subword complexity over all overlapfree binary words and all (73)-power-free binary words, but not over all ( 7 3) +-power-free binary words; – the twisted Thue-Morse word has the maximum possible sub...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1104.4425  شماره 

صفحات  -

تاریخ انتشار 2011